The replication archive contains the following files:

analysis.R = an R script that reproduces the results reported in the paper.

ccdfs_both.csv = data file for all threads (both directly commented and not) that could be classified by veracity.  The file includes the following variables:

	- top_level_post = an identifier for the Reddit thread
	- veracity = the veracity as established by the fact-checking comment
	- num_commenters = total number of unique commenters on the thread
	- num_comments_in_thread = total number of comments on the thread
	- max_depth_comments = maximum number of steps away from the top-level post of a comment
	- post_life_hours = difference in time between the time of the first comment's creation and the last comment's creation
	- directly_commented = an indicator for whether the thread received a direct fact-checking comment
	- is_deleted = an indicator for whether the text of the post had been deleted by the time of data collection
	- is_removed = an indicator for whether the text of the post had been removed by the time of data collection

ccdfs_directly_commented.csv = data file for only threads that were directly commented and could be classified by veracity.  The file includes the following variables:

	- top_level_post = an identifier for the Reddit thread
	- veracity = the veracity as established by the fact-checking comment
	- num_commenters = total number of unique commenters on the thread
	- num_comments_in_thread = total number of comments on the thread
	- max_depth_comments = maximum number of steps away from the top-level post of a comment
	- post_life_hours = difference in time between the time of the first comment's creation and the last comment's creation
	- score = the score (upvotes minus downvotes) of the post at the time of data collection
	- time_to_first_factcheck = difference in time between the post's creation and the first fact-checking comment on the thread

ccdfs_same_url.csv = data file for only threads that had not been directly commented that could be classified by veracity.  The file includes the following variables:

	- top_level_post = an identifier for the Reddit thread
	- veracity = the veracity as established by the fact-checking comment
	- num_commenters = total number of unique commenters on the thread
	- num_comments_in_thread = total number of comments on the thread
	- max_depth_comments = maximum number of steps away from the top-level post of a comment
	- post_life_hours = difference in time between the time of the first comment's creation and the last comment's creation

comments_linking_factcheckers.csv = data file for comments that had linked directly to fact-checkers:

	- id = an identifier for the Reddit comment
	- veracity = the veracity as established by the fact-checking comment
	- score = the score (upvotes minus downvotes) of the comment at the time of data collection



